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ABSTRACT 


Artificial Neural Networks offer an efficient platform for condition monitoring strategies 
in machinery and plants where the number of components and processes are too many 
and complex to be mathematically modeled appropriately. A machine diagnosis scheme, 
generally makes use of a knowledge based technique to identify the reasons for departure 
of the machine from its normal behavior. A condition monitoring activity is also 
intricately dependent on the quality of instrumentation and efficient data processing. The 
present study aims to integrate the fast emerging Virtual Instrumentation procedures with 
techniques of neural networks for fault identification in rotating machinery. The emphasis 
is on development of automated and user-interactive procedures for data acquisition as 
well as fault diagnosis. Experiments have been conducted on a Machinery Fault 
Simulator rig. Various types of faults were introduced and vibration data acquired from 
multiple stations on the rig were processed and employed to train the neural networks. 
The networks were based on back-propagation and probabilistic procedures. Neural 
Network training was carried out with the chosen architecture till desired degree of 
convergence is achieved. The network is finally tested and validated for test data. 
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CHAPTER 1 


INTRODUCTION 


Rotordynamics has been an area of extensive research during the past two decades. 
Experimental and analytical techniques, both have been routinely used for diagnosing 
faults in operational rotating machinery. These techniques are also used during the design 
and development processes. Condition monitoring of rotating machinery is essential in 
order to ensure operation within design constraints and anticipate problems in time, so as 
to prevent complete failure. Advances in data acquisition techniques and the 
developments in the areas of knowledge acquisition, processing and progr ammin g offer 
potential for the application of automated strategies for machinery process supervision 
which would permit early detection of faults than is possible through conventional limits 
and trend checks. Meaningful and more efficient strategies for condition monitoring of 
rotating machinery are a pressing need, due to increasing demands on reliability and 
safety for power generating systems. Vibration analysis has proved to be a powerful tool 
for analysis of failure in rotating machinery. Vibration analysis with appropriate 
instrumentation can provide reliable knowledge of machine condition and allows 
relatively inexpensive repair on a pre-scheduled controlled basis in comparison to 
catastrophic failure whose repair may cost much more. 

There are three phases in the evolution of on-line computer-based vibration monitoring of 
rotating machinery. The first phase includes collecting the vibration data and other 
operating parameters that are relevant. Phase two consists of identifying the complicated 
but normal behavior of the system, accounting for the variation of operating parameters. 
In phase three, use is made of knowledge based systems (expert systems) to identify the 
reasons for departures from normal behavior. Artificial Neural Networks (ANN) also find 
their role in the third phase of a monitoring strategy. Condition Monitoring implicitly 
involves data acquisition and analysis. Virtual Instrumentation is a recent technology that 
offers vast potential for easy and economic data acquisition and analysis. 
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The objective of the present study is to integrate techniques of Artificial Neural Networks 
with Virtual Instrumentation hardware and software to carry out experimental 
investigations on a Machinery Fault Simulator rig on fault identification. The Rig can be 
used to simulate a variety of machinery faults in a controlled maimer. Vibration signals 
are picked up on the rig from multiple stations, using accelerometers or proximity pick- 
ups. The signals are conditioned and transferred to a computer. An attempt has been 
made to automate most of the signal processing activities by writing suitable codes in 
different software packages. Online data acquisition, time domain data display, Fast 
Fourier Transformation and frequency domain display was carried out using Virtual 
Instrumentation in LabVIEW. Feature extraction from the frequency domain data has 
been carried out in M ATLAS. Data base creation, neural network training and testing has 
been carried out using Artificial Neural Network (ANN) toolbox available in MATLAB. 
Simulation and data acquisition has been illustrated for eighteen faults which include 
individual faults as well as in combination with one another. Out of these six cases were 
selected for development of the ANN scheme. Training is carried out, in each case with 
one himdred sets of vibration data from four channels. Back-Propagation and 
Probabilistic Neural Network (PNN) have been employed. Post training simulation for 
ten sets of vibration data for each fault showed a 100% correct diagnosis for both 
networks. 

A brief review of commonly occurring faults in rotating machinery, their symptoms, and 
fault diagnosis procedures are discussed in Chapter 2. The basic principles and common 
algorithms of ANN are discussed in Chapter 3. Chapter 4 describes the experimental set- 
up and data acquisition. Feature extraction, network training and validation are covered in 
Chapter 5. Conclusions and scope for future work are listed in Chapter 6. 



CHAPTER 2 


FAULT DIAGNOSIS IN ROTATING MACHINERY 


The commonly occurring faults in rotating machinery and currently available strategies 
for fault diagnosis and expert systems for fault diagnosis are discussed in this chapter. 

2.1 Common Faults in Rotating Machinery 

Rotordynamics has been an area of extensive research during the past two decades. 
Reference can be made to the studies by Sohre (1991), Rao (1992), Childs (1993), 
Dimentberg (1988) for a comprehensive discussion in the field, including fault 
classification and diagnosis. The most common problems are generally related to ABC 
Category (Eisenmann, 1998) - Alignment, Balance and incorrect Clearances (typically on 
bearings). The most prominent types of rotating machinery problems are disciassed below 
^ong with the causes and symptoms. 

U abalance 

Rotor unbalance is the major problem in most rotating machinery. Unbalance forces arise 
^iue to eccentricity between the mass and geometric centers caused by material, 
itianufacturing defects or by damage during operation. Vibration due to imbalance 
prominently occurs at a firequency equal to the rotor speed. However, specific unbalance 
forces caused as in the case of ball bearing wear give rise to additional characteristics like 
high frequency signals corresponding to bearing vibration firequency. 

Mi salignment 

Misalignment is present in almost every machine to some extent. During the course of 
operation a email degree of misalignment may attain higher proportions. Misalignment 
oan be classified as parallel or angular. It is a major source of vibration and leads to other 
problems like bearing overheat/damage, loss of power etc. Frequencies are 



characteristically twice the running speed, but first and sometimes third or fourth 
multiples of running speed are also observed. 


Rolling Element Bearing Defects 

Rolling element bearings contain elements that roll between an inner race and an outer 
race vdth very close clearances. They provide high stiffness to the rotor but are low in 
damping. Application of high loads, shock loads, or bearing attrition due to extended run 
times increase the internal clearances resulting in bearing failure. Typically four types of 
defects are observed in rolling element bearings, each giving rise to vibrations at a 
specific frequency. Reference can be made to Eisenmann (1998) for a detailed discussion 
on bearing problems. Briefly the faults and the predominant vibration frequencies can be 
sximmarized as 

Type of Defect Frequency 


Outer Race Defect 

Inner Race Defect 
Ball Spin Defect 

Train/Cage Defect 
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where, 

N = Number of contained balls or rollers 

RPM = Rotational speed of bearing inner race (Hz) 
Dball = Diameter of ball or roller 
Dpitch = Pitch diameter of balls or rollers 
P = Bearing load contact angle (degrees) 


4 



Cocked Rotor 


A rotor with its plane not perpendicular to axis of rotation is termed cocked. Also, 
unevenness in mass distribution on either side of the plane perpendicular to rotational 
axis and passing through geometric center of the rotor produces cockiness. This fault 
generates excessive vibration in axial direction prominently at the first multiple of 
rotational firequency. Sometimes, second or third multiples of rotational fi'equency are 
also observed. 

Rotor Rub 

Contact of any mechanical element of the system with the rotor causes rubbing. It can be 
radial or axial. Mechanical seal rub is a typical example of radial rub and casing rub at 
shaft end is a case of axial rub. Mostly both the types of rubs generate vibration at 
frequency equal to twice the rotational frequency but lower and higher frequencies are 
also common. 

Distortion 

Casing and foundation distortion cause vibration in an indirect way, either by generating 
misalignment between driver and driven machines or by causing internal rub or uneven 
bearing contact. This in turn transmits forces to the rotor, inducing it to generate forces of 
its own, such as unbalance and a wide variety of oil film and friction induced forces. 
Another possibility is that the loads on casing supports shift and can set off a series of 
resonance problems. Piping forces and foundation distortion often cause this type of 
difficulty. 

Asymmetric Shaft 

The response of the asymmetric shaft has several harmonics and the frequencies observed 
are first, second or third multiples of the rotational frequency. 

Mechanical Looseness 

Loose rotor components such as disks, sleeves, thrust collars etc. cause internal friction 
problems. The frequency of vibration is always the rotor critical speed. Mechanical loose 
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components like bolts give rise to lx rev and harmonic frequency signals due to 
secondary phenomena. The amplitude and phase continually change. Loose assembly of 
bearings give rise to sub harmonic response and the typical frequency response is 
0.5xrev and (l/3)xrev. This could be mistaken for oil film whirl, particularly in the 
region of twice the rotor critical speed, around the threshold of instability. 

Oil Film Whirl 

OU whirl m an oil lubncated journal bearing can occur due to preload forces, 
shafVbearing conditions, shaft eccentricity/concentricity or initial rotor deflection. In 
such an eventuality a flowing wedge of oil forms in the bearing and drives the shaft ahead 
of its forward circular motion within the bearing clearance. Theoretically, the fiequency 
of response of oil whirl should be at 0.5 x rev. However, it occurs around 
0.45 - 0.48 X rev . 

Bending Resonance 

Bending critical of the rotor occurs when the rotational speed is equal to its lateral 
resonance frequency. Resonance of the structure, support and auxiliaries cause fairly 
large amplitudes of vibration at the rotor speed and this occurs over a narrow range of 
speed of operation. Such resonance can be cured by the addition of damping. 

Gearbox Defects 

Gearboxes generate additional frequency components equal to running speeds of various 
shafts associated with them. Also, gear teeth meshing frequencies are prominent. Defects 
in any of the shafts show frequency components as multiples of the rotational frequency 
of that shaft. Gear teeth damage^reak off show higher amplitudes at gear teeth meshing 
frequency. 

A summary of the rotating machinery defects, characteristics and common remedial 
measures are given in Table 2.1. 
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Table 2.1 Identification and Correction of Malfunctions in Rotating Machinery 
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2.2 Diagnostic Techniques 

Experimental and analytical techniques, both, have been routinely used for diagnosing 
faults in operational rotating machinery. These techniques are also used during the design 
and development processes. Parameter identification techniques also find wide use to 
quantify natural frequencies, mode shapes, damping stability and frequency-response 
characteristics. Some of these diagnostic techniques are discussed below. 

2.2.1 Time History or Waveform Analysis 

Data is sampled at specific time intervals and plotted as a function of time. This gives a 
realistic view of the actual dynamic behavior of a machine. However, the data is normally 
too difficult to analyze in such form as signals are not always periodic and are mostly 
polluted by noise. Noise can be reduced by signal conditioners but cannot be eliminated 
altogether. An important feature of waveform analysis is its ability to interpret non-steady 
conditions as observed in corhpressors, pumps, hydraulic motors, machine tools etc. It is 
possible to extend the use of waveform analysis to study of phase changes and short 
transient impulses. Valuable information is provided by signal shape i.e. truncation, 
pulses, modulation, glitch or shaft-induced signals obtained from a proximity probe that 
are caused by scratches on the shaft. Time between events represents the frequency 
component specific to the machine. The phase between two signals provides information 
about vibratory behavior that can be used to diagnose a fault such as a misalignment and 
mass unbalance. 

2.2.2 Orbital Analysis 

Signals obtained from two probes mounted perpendicular to each other in a single 
vertical plane perpendicular to the shaft, when plotted on two mutually perpendicular 
axes, generate an actual trace of a shaft’s rotation, called the orbit. Orbital analysis is 
employed to diagnose problems like shaft pre-loading. For example, a horizontal pre-load 
(e.g. misalignment) restricts horizontal motion and will produce an elliptical orbit with 
the vertical axis longer than the horizontal. Other defects like torque reaction, pressure 
angle forces associated with gearboxes, internal seal mb due to casing distortion, shaft 
mbs etc. can also be studied through orbital analysis. 
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2.2.3 Spectrum Analysis 

Spectrum analysis is conducted with an FFT algorithm or filters. The processed signal 
gives information about the amplitude and phase content at various frequencies. 
Frequencies of vibration response can be related to direct excitation frequencies or their 
orders, natural frequencies, sidebands, subharmonics and sum-difference frequencies. 
Frequencies of the measured vibration response are equal to those of the forces causing 
the vibration. The operating speed of the machine is usually the fundamental frequency 
used in the analysis. The peaks at multiple orders of the fundamental are due to a variety 
of faults that are present in the rotating machinery. The exact fault can be predicted by 
comparing with good test data that is directly proportional to the information available on 
the design of a machine and its working mechanisms. 

2.2.4 Cepstrum Analysis 

Cepstrum is defined as the inverse Fourier transform of the logarithmic power spectrum 
commonly used in vibration analysis. The complex cepstrum is reversible to a time 
signal. A cepstrum is effective for accurately measuring frequency spacing, harmonic and 
sideband patterns in the power spectrum. One component of the cepstrum represents the 
global power content of an entire family of harmonics or sidebands. The severity of a 
defect such as that in a rolling element bearing is provided by one component of the 
cepstrum display. 

2.2.5 Expert Systems 

Expert systems are programs that use inference techniques involving formal reasoning 
normally performed by human experts. These systems can advice, diagnose, analyze and 
categorize based on previously defined knowledgebase. The system usually consists of 
two parts, the data or knowledge base and the interference engine, which is the diagnosis 
and advisory part. Usually the knowledge base consists of two parts, viz., the working 
memory and the rule-base. The rule-base does not change during a given consultation; 
thus it is a static database. The working memory consists of facts related to a specific 
consultation and it is thus a dynamic database. The dynamic memory is empty at the 
beginning of a diagnosis. During the diagnosis process, the interference engine uses the 
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facts and rules of the rule-base in conjunction with the data from the observed signals, to 
add facts in the working memory. Fig. 2.1 shows the principles of such an expert system 
program (Rao,1996). 



Figure 2.1 Structure of an Expert System 


The work carried out in this thesis is based on Artificial Neural Network techniques, 
which are described in the next chapter. 
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CHAPTER 3 


ARTIFICIAL NEURAL NETWORKS 


Artificial Neural Networks are inspired by the functioning of the human brain. A human 
brain consists of millions of computing elements called neurons. They communicate 
through a connection network of axons and synapses. Similarly, computation is carried 
out in an ANN through several nodes called neurons each interconnected with the others. 
Similar to the human brain ANNs learn from experience, generalize from previous 
examples to new ones and abstract essential characteristics from the inputs containing 
randomness and irrelevant data. 

ANN has been attracting the attention of scientists and technologists for long. McCulloh 
and Pitts formulated the first formal definition of a synthetic neuron model based on 
biological model in 1943. However, neural networks being massively parallel distributed 
processor, require tremendous amount of computation. Non-availability of fast and 
affordable computing power had limited its growth till the last decade of twentieth 
century. Application of ANNs to vibration and fault diagnosis are relatively few. Mayes 
(1994) applied ANN for on-line vibration monitoring of large turbo-generators. The 
investigations focussed on data processing and the use of neural networks. Elkordy, 
Chang and Lee (1994) investigated the applicability of ANNs for vibration signature 
analysis of a five-storey structure. McCormick and Nandi (1997) contemplated the 
application of ANN for real-time fault classification of rotating shafts. Vyas et. al. (2000) 
used Sohre’s knowledge base to train one each of Back-propagation and Probabilistic 
Neural Networks to identify rotor faults. 

3.1 The Artificial Neuron 

The artificial neuron was developed to mimic the first order characteristics of the 
biological neuron. An artificial neuron is an information processing unit that is 
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fundamental to the operation of a neural network. A set of inputs is applied, each 
representing the output of another neuron. Each input is multiplied by a corresponding 
weight, analogous to a synaptic strength, and all of the weighted inputs are then summed 
to determine the activation level of the neuron. Fig. 3.1 shows a typical model of a 
neuron. The three basic elements of the neuron model are described below (Zurada, 
1999). 

Synapses: Synapses are connecting links, each of which is characterized by a weight 

or strength of its own. It specifies the connection between a signal Xj at the input of the 
sample j, and a neuron k. The weighting factor is Wkj. 

Adder: Adder sums up the input signals weighted by the respective synapses of 

the neuron. The operation is similar to that of a linear adder. 

Activation Ftmction: An activation function defines the output of a neuron in 

terms of the activity level at its input. It limits the amplitude of the output of the neuron 
and introduces non-linearity into the network. The activation function is also referred to 
as squashing function since it is used to limit the output values of a neuron in a definite 
small range irrespective of the inputs to it. 


The neuron also includes an externally applied threshold function, 9^, which has the 
effect of lowering the net input of the activation function. On the other hand, employing a 
bias term rather than a threshold may increase the net input of the activation function. 
The bias is the negative of the threshold. In mathematical terms a neuron k is described 
by 


u 






kj J 


yk= 


where, x ,x •••x are the input signals, 

12 p 




(3.1) 


w , w . . . ‘H’, are the synaptic weights of 

fC 1 K/ JL Kp 


the neurons k, 


u is the linear combiner output. Ok is the threshold, ^.) is the activation 
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Figure 3.1 Model of a Nonlinear Neuron 


Fixed Input 



(Including 

Threshold) 


Figure 3.2 Modified Model of a Nonlinear Neuron 
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function and is the output signal of the neuron, v is the output from the neuron after 


summation. Equation (3.1) can be equivalently written as 






(3.2) 

(3.3) 


In equation (3.2) a new synapse has been added, whose input and weight, respectively, 
are 

xo = -1 ; Wko = 0k (3.4) 

This modified model of a neuron is shown in Fig. 3.2. 

3.2 Network Architectures 

The manner in which the neurons in a neural network are stractured is intimately linked 
with the learning algorithm used to train the network. Four general classes of network 
architectures are: 

(a) Single Laver Feed Forward Network : This is a network of neurons organized in 

the form of layers. The simplest form of a network consists of an input layer of source 
nodes that project on to an output layer of neurons, but not vice versa. This is of feed 
forward type (Fig. 3.3). 

(b) Multi-layer Feed Forward Network : It is similar to the one described in (a) above 

except that it has one or more hidden layers of neurons. Hidden layers of neurons are 
interface between the input and output layers. It is observed that some problems converge 
better when these hidden layers are used (Fig. 3.4). 

fcl Recurrent Networks : A recurrent network distinguishes itself from the feed- 

forward network in that it has at least one feedback loop. For example, a recurrent 
network may consist of single layer of neurons, with each neuron feeding its output 
si gnal back to the input of all other neurons (Fig. 3.5). A time delay may be introduced in 
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Nodes Neurons 


Figure 3.3 Single Layer Feed Forward Network 
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Figure 3.4 F®®** Forward Network with One Hidden Layer 
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Figure 3.5 Recurrent Network 
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the feedback path. Recurrent networks typically operate with a discrete representation of 
data and employ neurons with a hard-limiting activation function. 

(d) Lattice Structure : A lattice consists of a one or more multi-dimensional array of 
neurons with a corresponding set of source nodes that supply the input signals to the 
arrays (Fig. 3.6). The dimension of the lattice refers to the number of dimensions in 
which the graph lies. 

3.3 The Back-propagation Training Algorithm 

The Back-propagation Training Algorithm (BP A) is a popular training algorithm. Its 
major attraction is its suitability to a large number of applications and high rates of 
convergence. It is designed to solve the problems of choosing weight values for layered 
artificial neural networks with feed forward connections from input layer to hidden layer 
and then to the output layer. The algorithm performs the input to output mapping by 
minimizing a cost function using a gradient search technique. The cost function, which is 
equal to the mean squared difference between the desired and the actual net output, is 
minimized by making wide coimection adjustments according to the error between the 
computed and target output processing element values. There are two stages in the 
development of a back-propagation algorithm, namely forward pass and backward pass. 
During the forward pass all the weights of the network are initialized randomly and the 
network outputs and the difference between the actual and target output (i.e. the error) is 
calculated for the initialized weights. During the backward step, the initialized weights 
are adjusted to minimize the error by propagating the error backwards. The network 
outputs and error are calculated again with the updated weights and the process repeats 
till the error is acceptably small. These two steps are described below and the whole 
algorithm can be represented schematically as shown in Fig. 3.7. 

Forward Pass 

Referring to Fig. 3.7 the pth input vector to the network can be described as 
^p = [^pl --^pn)' 

where represents the input attribute i for the vector p. 
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Figure 3.7 Three Layer Back-propagation Network Architecture 


The net input to the hidden layer then becomes 


N 


net 


PJ 


/=! 




(3.6) 


where, w*, represents the weight of the layer h from node i to node j and represents 
the threshold for the node L of the layer h. 

The outputs from the hidden layer (which is the input to the output layer) are 



(3.7) 
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is the output from the node j of the hidden layer h, is the activation function at 
node j of the hidden layer h. The equations (3.6) and (3.7) are represented graphically in 
Fig. 3.8. 



Figure 3.8 Output Calculation at Each Node in the Forward Pass 


In the above the net-input values at output layer unit are 
net° = . +0^^ 

pk kj pj k 

while the outputs at output units are 

Individual error at each output unit is 

^pk ~ y pk ~ ^pk 

from which the overall mean square error can be computed as 


1 ^ 7 

E , 

P = \ 


(3.8) 

(3.9) 

(3.10) 

(3.11) 
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Backward Pass 


Weight adjustment is carried out at the output layer through the following procedure. 
Using equations (3.10) and (3.11), the mean square error, Ep, can be expressed as 

E, =i 2 ;(F,. -o;J =iz{yU -Ainefjf ( 3 . 12 ) 

The weight change of an output layer weight is the negative gradient of Ep with respect to 
output layer weights and can be written as 


dE^ 


d{net;,) 


df° d{net;,) 


pk) 


However, from equation (3.8) 


dw° 

hi 


dw° 

kj 


(l</ .+0p)=i . 

V kj pj k / pj 


and from equation (3.9) 




(3.13) 


(3.14) 


(3.15) 


Therefore, the weight change at the output layer weight is 


^ = {ypk -o°p, )/; {net )x i^j 
Now denoting 

Kk={ypk -o°p,)n{nefp,) 

the weight change at the output layer weights can be written as 


(3.16) 


(3.17) 


(3.18) 
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To make the learning process smooth and to ensure that the weight changes take place in 
the same direction, two network parameters — learning rate coefficient t] and momentum 
a. — are introduced in lieu of direct application of the above mentioned weights, so that 

wl,{t^-\)=wl{t)+tj{y,,-o°^)ff{nell,)xi^+ccwl{t-\) (3.19) 

A small value of 7 implies that the network will have to make a large number of 
iterations. Its value is normally kept between 0.05 and 0.9. It is often possible to increase 
its value as the network error decreases, thereby increasing the speed of convergence. 
Another way to increase convergence speed is by adopting an extra momentum term 
while updating the weights. This additional term tends to keep the weight changes in the 
same direction. The entire weight updating process at the output layer can be represented 
as shown in Fig. 3.9. 

Hidden Layer Output Layer Target 



Figure 3.9 Output Layer Weight Updating 
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While updating the weights for the hidden layers it should be noted that there is no target 
output and therefore the adjustment of weights is proportional to their initial contribution. 
From equations (3.8) and (3.9) one gets 


E, +^;)f 


(3.20) 


The weight change of hidden layer weights is the negative gradient of Ep with respect to 

h 

hidden layer weights w and is given by 


BE ( \ do° d{net° \ 8i , dinet^ ) 

P _ -n° I \ pkJ pj \ pj J 

^Vpk °pkK 


dw 


k^Pk P'^'d{net° ) di dinet^ ) dw^ 

^ nk ^ Dl ^ ni ^ fi 


(3.21) 


J‘ ' pk ' PJ ' PJ ' ji 

and the individual terms on the right hand side of the above equation can be expanded as 


So" d{ /;( neC ) ) 


S( net" ) a( neC ) 


= //'(««;. ) = o%{^-o;, ) (3.22) 


s{net",t) 




Vj-1 


di . 

PJ 


di 


< 


PJ 


(3.23) 


di^ 


PJ 


d{fj net ) ./,■/ H ) = 


a( netl^ ) a( net^ ) 


PJ 


(3.24) 


y V 


d[netlj) pi 


dw^.. 

Ji 




''P^ 


(3.25) 


Substituting equations (3.22)-(3.23) in equation (3.21) one gets 
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(3.26) 



Network parameters 77 and a can be introduced in a manner similar to that in the case of 
the output layer, to express the final weight change at the hidden layer as 


^.(r + l) = w^.(0+^^ k-o\k{y +aw^(t-l) (3.27) 

pi PJ ^ PJ ' \ pk pk / pk\ ok I ki ii ^ ^ 


The entire weight updating process at hidden layer can be schematically represented as 
shown in Fig. 3.10. 



Figure 3.10 Hidden Layer Weight Updating 
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3.4 Types of Activation Functions 

Various types of transfer functions, are employed in developing a neuron. Some of 
the common transfer functions are described below. 

Name 

• Linear Activation Function 

• Positive Linear Activation Fxmction 

• Hyperbolic Linear Tangent Sigmoid 
Activation Function 

• Logistic Sigmoid Activation Function 

where /?is a constant, and <fi{.) is the activation function. The above four activation 
functions are represented graphically in Fig. 3.11. 


Mathematical Representation 
(p(x) =x 

(p(x) =x for x> 0, 

= 0 for x< 0. 

2 


(p{x) = 


1 + exp(- px) 


-1 


(p{x) = 


1 + exp(- px) 


3.5 Training Algorithms 

Training is accomplished in a network by sequentially applying input vector, while 
adjusting network weights according to a predetermined procedure. During training, the 
network weights gradually converge to values such that each input vector produces the 
desired output vector. Neural network toolbox in MATLAB provides fifteen different 
learning functions. Out of these two were used in the current study and are briefly 
described below. 


Gradient Descent Back-propagation ('‘TRAINGD’') 

TRAINGD updates weight and bias values according to gradient descent as follows 
div = 77xd£ /dw (3.28) 

p 

where, w is the weight/bias variable, ^is the learning rate, and Ep is the performance. 
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This training algorithm updates weight and bias values according to the resilient back- 
propagation algorithm as follows 

dw = Aw X sign (gw) (3 .29) 

where, the elements of Aw are all initialized to A) and gw is the gradient. At each 
iteration the elements of Aw are modified. If an element of gw changes its sign 

between one iteration and the next, then the corresponding element of Aw is decreased 
by certain value. If it maintains the same sign from one iteration to the next, then the 
corresponding element of Aw is increased by certain value. This continues change in 
value of Aw ensures a gradual but faster convergence. 


3.6 Probabilistic Neural Network (PNN) 

Investigations were also carried out, during the present study on the suitability of 
probabilistic neural networks for fault identification. Probabilistic Neural Networks 
(PNNs) find their application mainly in classification problems (Bose and Liang, 1996). 
Some of the salient features of such networks are briefly described here. 


A Probabilistic Neural Network bases itself on Bayes’ Rule which describes the 
probability of the presence of a particular fault conditional to the observation of a certam 
symptom as 



p(eIHi) p[Hi) 


n = 1 


where 


(3.30) 


p{h^ / e) the probability that fault Hi is present, given a symptom E 

P{E / Hi ) probability that symptom E will be observed when fault H, 

is present in absence of any specific symptom 
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the apriori probability that fault H is present in absence 

i 

of any specific symptom, 
number of faults 


The following computational procedure can be followed. 

•^(^/ ) Information generally obtained fi:om machine history, 

otherwise taken as unity for all possible faults 

X p{e / H ) This term in the denominator is constant for all i 

77=1 " 


or 


Given a symptom E, the network computes the conditional probability p\ H / £ ] , of 
the presence of faults (/ — 1,2 ... k). A comparison is then made: 

if pi^H_ / £ j > p[hj I e] for all then the fault is H. , or 

if p[eIH^ > p{eIH ^ for all j ^ / then the fault is H.. 




is computed using the following 


^ ^ I I 


(£ - {E - e\) 

2cr^ 

I 


(3.31) 


where 

m number of observations e in every training symptom pattern E 

n. number of training symptom patterns pertaining to the i* fault 

<^1 smoothing parameter computed using radial basis functions 
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The probabilistic neural network, in addition to the input layer, has two hidden Layers 
and an output layer (Fig. 3. 12). Its major difference with a back-propagation network is 
that it can be constructed after only a single pass of the training data sets. Also, the 
activation function is statistically derived from estimates of the probability density 
functions based on training patterns. 


Feature Space Number of Number of Selected 

Dimension Exemplars Classes Class 



Figure 3.12 Probabilistic Neural Network 


Fault simulation on the rotor rig, data acquisition and processing and network training 
and validation are described next in chapters 4 and 5. 
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CHAPTER 4 


EXPERIMENTAL SET-UP AND DATA ACQUISITION 


The experimental work was carried out on a commercially available rotor rig. Several 
faults were deliberately introduced in the rotor assembly. Vibration signals were acquired 
from a number of appropriate stations on the rig, through Virtual Instrumentation 
techniques of Lab VIEW software. The vibration signals were processed and fed to the 
neural network simulator which was developed using MATLAB toolboxes. 

4.1 Machinery Fault Simulator 

The experimental rotor rig (Machinery Fault Simulator, make Spectraquest, type 2) 
consists of a basic framework involving a shaft supported in bearings and driven by a 
D.C. motor (Figs. 4. 1 and 4.3). The kit also carries a reciprocating mechanism, which can 
be driven through a gearbox. The gearbox, in turn is driven by a belt drive consisting of 
two V-belts. A large number of rotating machinery faults can be introduced in this MFS 
and realistic dynamic conditions under rotation can be created. MFS is modular in design 
to facilitate easy removal/replacement of components. Additionally, the kit cames a 
number of components with deliberately introduced faults. This facilitates introduction of 
faults in the rotor assembly in a controlled manner. The present study was restricted to 
common faults in rotating shafts, belt drives and gearboxes. Faults in reciprocating 
mechanisms were not simulated. The rig has an adjustable speed range of 0 - 6000 rpm 
(0-100 Hz). Sweep rate adjustment in this speed range is also possible in order to 
simulate coast-up/coast-down operations. Different loading conditions can be simulated 
in MFS by setting torque in an adjustable torque magnetic brake. 

4.2 Instrumentation 

Location of sensors is crucial to any instrumentation scheme. In a rotating machinery, 
shafts are normally more flexible in transverse direction than other rotating components 
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and shaft deflection is considered to be an appropriate measure for rotor faults like mass 
unbalance, bent shaft, rotor rub etc. Bearings form the interface between rotating shafts 
and stationary base and faults in either of these make the bearings most susceptible to 
damage. Also, vibration signals from most of the components get transmitted to bearings 
and therefore bearing housing is an appropriate location for mounting vibration sensors. 
Vibrations sensed at foundation/bed-plates also provide useful information since a 
substantial portion of energy gets transmitted through the machine foundation. 
Measurement of casing vibration provides a view to the relative motion of casing with 
respect to the shaft and free space. 

During the present study, seven sensors were used to pick up the vibration signals at the 
following locations (Figs. 4.2 and 4.3): 

1 Shaft displacement in the vertical direction at left side bearing end: Sensor - non- 
contact type eddy current displacement probes (Proximitor), model 3300 RAM, 
Bently Nevada make ; associated transducer and TK 15 signal conditioner. 

2 Shaft displacement in the horizontal direction at left side bearing end: Sensor - same 
configuration as in 1 above. 

3 Keyphasor measurement for phase information (a small plate externally attached to 
the shaft was used to produce the required trigger signal): Sensor - same 
configuration as in 1 above. 

4 Bearing housing vibration, in the vertical direction at the right side bearing: Sensor - 
accelerometer (model 4374, Bruel and Kjaer make) along with a charge amplifier, 
(model 2635, of the same make). 

5 Bearing housing vibration, in the horizontal direction at the right side bearing: 
Sensor - same configuration as in 4 above. 

6 Bearing housing vibration, in the axial direction at the right side bearing: Sensor - 
same configuration as in 4 above. 

7 Foundation vibration at its mid-span: Sensor - same configuration as in 4 above. 
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Figure 4.2 Instrumentation and Data Acquisition Devices 
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Figure 4.3 Schematic Diagram of the Experimental Set-up 



















Vibration data is obtained simultaneously at the above seven stations and acquired into a 
P-III computer through a Data Acquisition Card (AT-MIO-16E-1, National Instruments, 
Texas) and Lab VIEW software. LabVIEW, like C or BASIC is a general-purpose 
programming system and includes for data acquisition, data analysis, data presentation, 
and storage. It uses a graphical programming language, G, to create programs in a block 
diagram form. LabVIEW programs are called Virtual Instruments (FIs) because of their 
appearance, and their operation that can imitate actual instruments. An interactive user 
interface of a VI, called Jront panel is used to pass on user defined parameters into the 
program, called block diagram and program’s output parameters are displayed in the 
front panel. Since user interactive programs can be written for various applications and 
output parameters of the program can be displayed in text as well as graphical form, it 
offers tremendous potential for developing wide variety of applications. The front panel 
is used for data input in form of text box, knobs, buttons etc. and output in the form of 
text box, figure box, graphs, plots etc. Modular programming techniques make LabVIEW 
suitable for simulating a large number of instruments limited only by the data acquisition 
board used. 

4.3 Data Acquisition, Storage and Display 

A Virtual Instrument with the following facilities in a single Front Panel (Fig. 4.4) was 
developed, in LabVIEW, during the present study. 

(a) Time domain vibration data acquisition from seven channels with following user 

controls (figures/text wdthin brackets indicate default values); 

(i) Scan Rate ( 1 000). 

(ii) Number of data points to be read before display (4096). 

(iii) Device and Channel numbers from which to acquire data (Device no. 1 
and Channel nos. 0 to 6). 

(iv) Whether to use trigger for start of data acquisition (No). 

(v) Channel to be used for trigger signal (0). 

(vi) Trigger to be used on rising/falling signal levels (none). 

(vii) Trigger level (0). 
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Figure 4.4 Typical Front Panel of Data Acquisition, Display and Logging VI 


































(viii) Trigger hysteresis (0). 

(ix) Number of data points to be recorded before trigger (200) 

(b) Conversion of time domain data to Frequency domain data by Fast Fourier 
Transform (FFT) after filtering with a Hanning window with user control for 
number of data points to be used for the transformation (4096). 

(c) Root Mean Square (RMS) value evaluation of time domain data for each channel. 

(d) Option for data logging of both time and Frequency domain data of all seven 
channels in user defined file names in spreadsheet format. 

(e) The display features include the following 

(i) Graphical Time domain data. 

(ii) Graphical Frequency domain data with control for linear/log scales. 

(iii) RMS values of time domain data. 

(iv) Actual sampling rate. 

4.4 Fault Simulation 

A preliminary test was done to check the working of the Machinery Fault Simulator. A 
large number of faults were simulated on and vibration signals were obtained to make a 
visual inspection of the specific features in the corresponding Fourier Spectrum. 
However, for the purpose of development of the neural network, the number of 
considered faults was restricted to 6, in order to keep data handling at a manageable level. 
The eighteen faults simulated for the preliminary check are: 

1 . No Fault - same as factory condition of MFS 

2. 5 .2 gm mass unbalance on right rotor 

3. 5.2 gm mass unbalance on left rotor 

4. 5 mil parallel misalignment in horizontal plane 

5. 3 mil angular misalignment in horizontal plane 

6. Combination of faults at 2 and 4 above 

7. Combination of faults at 2 and 5 above 

8. Damaged inner race of right bearing 

9. Damaged outer race of right bearing 

1 0. Ball Spin in right bearing 
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1 1 . Cocked left rotor 

12. Mechanical rub on shaft near left bearing 

13. No Fault with belt drive and gear box connected 

1 4. Tight Belt with belt drive and gear box connected 

1 5 . Loose Belt with belt drive and gear box connected 

16. As in 1 3 above and mass unbalance in gearbox output shaft 

17. As in 14 above and mass unbalance in gearbox output shaft 

18. As in 15 above and mass unbalance in gearbox output shaft 

Faults (1) to (12) were simulated without the belt drive i.e. gearbox and reciprocating 

mechanisms, while the rest were simulated without the reciprocating mechanism alone. 
Vibration data was acquired and stored into the computer for each of these faults. Fourier 
Spectrum of typical vibration signals in the vertical direction picked up by the 
accelerometer at the bearing housing for each of the eighteen faults are displayed in Figs. 
4.5 (a-r). It can be seen that each fault carries jts own distinct signature in the firequency 
domain, and the location and magnitude of the peaks can serve as a useful tool in the 
reverse problem of identification of an unknown fault from a frequency domain signal. 

The following six cases were chosen, from the 18 faults listed earlier, for development of 
the Neural Network Simulator. 

1 . No Fault - same as factory condition of MFS 

2. 5.2 gm mass unbalance on right rotor 

3 . Damaged inner race of right bearing 

4. Damaged outer race of right bearing 

5. Ball Spin in right bearing 

6. Cocked left rotor 

A total of one hundred and ten sets of vibration signals were acquired for each of the 
above six cases. Each set includes signals from the seven sensors mentioned earlier. Out 
of the six hundred and sixty signals, six hundred were used for neural network training 
and the remaining sixty were set aside for testing the trained algorithm. Time domain 
vibration signals were acquired at a scan rate of 2560 samples/sec. Frequency domain 
transformation was carried out on line using an FFT algorithm. 
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Figure 4.5(Contd.) Bearing Housing Acceleration (Vertical Direction): Rotor Speed 1800 rpm 
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Figure 4.5(Contd.) Bearing Housing Acceleration (Vertical Direction): Rotor Speed 1800 rpm 







CHAPTER 5 


NETWORK TRAINING AND VALIDATION 

Input to Artificial Neural Networks is normally in matrix fonn with a column 
representing a set of data. Elements in this column are indicative of the salient features 
(expressed in numeric form) of a data set. Extraction of the significant features from raw 
data is crucial to any condition monitoring exercise. Extraction of these features from a 
vibration signal can be arduous since they are almost always contaminated with noise. 
However, in the present exercise since the rotor rig is custom made for fault simulation, 
the noise level is low and the signals did not require any treatment for noise removal. 


5.1 Feature Extraction 

Feature extraction from vibration signatures to characterize particular machine faults has 
received considerable attention from researchers. Elkordy, Chang and Lee (1994) used 
state variables computed from the strain mode shapes and displacement mode shapes for 
stmctural damage diagnosis and condition monitoring of multi-storied buildings. 
Canonical correlation analysis was used by Mayes (1994) to study oil- whirl problem and 
reactive load dependent behavior of power generator. McCormick and Nandi (1997) 
extracted time-invariant features from stationary time series by estimating zero-lag 
moments. However, these procedures consume more computational time and memory 
since the moments and cross-moments are required to be calculated. FFT is widely 
recognised as a powerful tool to characterize most of the common faults in rotating 
machinery. Frequency spectrum is most commonly used and understood by most of the 
vibration professionals. Also, frequency domain data are easy to obtain from time series 
since a number of FFT analyzers, both digital and analogue, are available off the shelf. 
However, randomness in the frequency spectrum along with shifting frequencies does not 
allow simple diagnosis. In the present work an algorithm has been developed for 
automatic detection of peaks in a Fourier Spectrum. The algorithm can take the frequency 
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domain spectrum as input and provide the amplitudes and frequency locations of the 

prominent peaks, as output. 

The peak detection algorithm addresses the following aspects. 

(a) A frequency spectrum would normally comprise of a large number of crests and 
troughs. V/hile the visual process of peak identification is subjective (dependent 
on the judgement of the viewer), a computer algorithm will require quantification 
and comparison in order to decide which of the crests on the spectrum can be 
labeled as 'peaks'. One way of quantification would be to first compute the mean 
of the signal and then setting a threshold, which can be a function of this mean. 
The next step would be comparison of the amplitude at every frequency on the 
spectrum with this threshold. If the amplitude is higher than the threshold then it 
can be said that a peak exists at the particular frequency. The algorithm developed 
in this study follows the above logic. Also, flexibility is provided to define the 
threshold as some multiple of the computed mean, in a user-interactive manner. 

(b) Searching for peaks is carried out over small frequency bands rather than discrete 
individual frequencies on the spectrum. This is done in order to account for the 
min or shifts in the amplitudes due to a variety of reasons, including windowing. 

(c) A frequency spectrum always carries a large number of peaks. Some may be 
located at natural frequencies of the machine, while others may occur at 
harmonics of the rotational speed. Search at all possible frequencies on the 
spectrum may be time consuming. A decision should be made regarding the 
specific frequencies at which the algorithm should conduct a search. 

(d) Neural networks work on the principle of multiplying various synaptic weights to 
an input data to generate the input to its activation function. The importance of all 
the peaks in this scheme cannot be identical and therefore the peak-detection 
algorithm should retain information about the relative magnitudes of the peaks. 

Automated peak detection was carried out, during the present study, for the first five 

harmonics of the rotational speed. The program was written in MATLAB, which utilises 
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the Lab VIEW data in a spreadsheet form. The Flow-Chart of the algorithm is given in 
Fig. 5.1. 


It has user interactive control for the following values; 

(a) Rotational Speed of the machine 

(b) Upper level of harmonic up to which harmonics are to be evaluated 

(c) Frequency band around the frequency of interest for determining presence/ 
absence of peaks 

(d) Whether half harmonics/ fractional harmonics are also to be evaluated 

(e) Multiple of average amplitude to be taken as a threshold for defining a peak 

(f) Type of data output required. Data output can be in five different types as listed in 
the algorithm in Fig. 5.1. This facilitates use of various types of peak detection 
data for neural networks or other applications. For this study, ratio of mean 
amplitudes of frequency bands around the peak to the mean amplitude of 
particular channels was used. 


Figs. 5.2 - 5.7 show typical sets of the actual Fourier Spectra and plots of peaks identified 
through algorithm discussed above. The plots pertain to vertical and horizontal 
acceleration measured at the bearing housing for each of the six faults under 
consideration. 

During this study, while measurements were taken with seven sensors, which included 
both proximity pick-ups and accelerometers, it was found that shaft displacement signals 
acquired by proximity probes do not provide any additional information for the present 
rotor set up. Also, phase information from the keyphasor was not utilised during this 
study. Therefore, further data processing was carried out only for the four accelerometer 
channels, namely (i) bearing housing vertical acceleration (ii) bearing housing horizontal 
acceleration (iii) bearing housing axial acceleration (iv) foundation vibration. This 
reduction helps in reducing the computational effort considerably. 
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Figure 5.4 Feature Extraction: Damaged Bearing Inner Race (30 Hz, Bearing 
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5.2 Training Vector 

The plots in Figs.5.2(b, d), 5.3(b, d)...5.7(b, d), provide the training vector for the neural 
network. For example, the peaks identified in Fig.5.3(b), which pertain to the vertical 
acceleration at the bearing housing for the mass unbalance fault, can be listed as 


Harmonic Number 

I 

II 

III 

IV 

V 


Peak Amplitude 

15.548 

0 

0 

0 

0 


Similar lists can be obtained from the other three sensors. These lists can be combined to 
form Table 5.1. 


The pattern in column three of Table 5.1 forms the input vector (of size 20) to the neural 
network. The network can be trained to identify this pattern with a binary pattern of the 
type [0 1 0 0 0 0]. This binary pattern forms the target vector associated with mass 
unbalance fault. 

Table 5.2 gives typical training vectors for the six fault configurations under 
consideration, along with the target vectors. It is to be noted that while the target vector 
for a particular fault remains fixed, the training vector vary depending on the variations in 
the individual signals. These variations can be noted from the data given in Table 5.3, 
which gives ten training vectors for the cocked rotor fault. The variation in training 
vectors is highlighted in Fig. 5.8(a-c). 
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Table 5.1 


Typical T raining V ector 


Sensor Number 

Harmojjp^ jg Number 

Peak Amplitude 

1 ■ Acceleration Vertical 

1 

15.548 

2 

0 

3 

0 

4 

0 

5 

0 

2. Acceleration Horizontal 

1 

15.873 

2 

4.942 

3 

4.057 

4 

8.235 

5 

0 

3. Acceleration Axial 

1 

6.479 

2 

4.545 

3 

3.015 


-4 

0 


3 

0 


1 

13.591 


2 

3.682 

4. Acceleration Foundation 

3 

0 


4 

0 


5 

0 


k: z''' i 




r-' -sTffTU 
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Table 5.2 Sample Training and Target Vector Sets 
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Table 5.3 Typical Training Vectors (for Cocked Rotor Fault) 
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Ampiftutie Amplitude 



Training Vector Number 


(®) First Harmonics: Horizontal Acceleration 



41 61 81 
Training Vector Number 

Second Harmonics: Horizontal Acceleration 



41 61 

Training Vector Number 


Third Harmonics: Horizontal Acceleration 


Figure 5.8 Variation in Training Vectors for Three Harmonics of a Single 
Channel Same Fault (Cocked Rotor) 
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5.3 Back-Propagation Network Training 

Various types of back-propagation architectures were investigated for their efficiency 
during training and in prediction of faults. These networks were developed using the 
Neural Network toolbox in MATLAB. In all these architectures the input and output 
vector sizes were kept constant at twenty and six, respectively. The number of hidden 
layers and their sizes were varied. The following architectures were tried (nomenclature 
20 -h/-h2-h5....- hie- 6 etc., where hk denotes the number of neurons in the Mi hidden 
layer): 


(i) 

20-8-6 

(ii) 

20-10-6 

(iii) 

20-12-6 

(iv) 

20-17-6 

(V) 

20-15-9-6 

(Vi) 

20-17-8-6 

(vii) 

20-20-8-6 


Architectures (i)-(iv) involve one hidden layer, while (v) and (vii) involve two hidden 
layers. Training was carried out for each of the above architectures using the same set of 
600 training vectors. Figs. 5.9 (a-g) show the convergence patterns of these networks 
(Fig. 5.9(h) refers to convergence of network (vii) above using TANSIG transfer 
function). The convergence results have also been summarised in Table 5.4. The 
minimum achievable mean square error in architecture (i), 20-8-6, is 2.7778e-4. This is 
achieved after 323 iterations. Increase in the number of neurons in the hidden layer 
initially results in better performance (architectures (ii), 20-10-6 and (iii), 20-12-6), 
whereby lesser value of the error is achieved, through lesser number of iterations. 
Increase in the number of neurons in the hidden layer to 17, results in over-training and 
yields increased error and slower convergence. It can be seen that error can been further 
reduced by increasing the number of hidden layers (architectures (v)-(vii)). The minimum 
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(a) Architecture 20-8- 6 
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(b) Architecture 20-10-6 


Figure 5.9 Convergence Pattern of Various Back-propagation 

Network Architectures 




Mean Square Error 




Figure 5.9 (Contd.) Convei^ence Pattern of Various Back-propagation 

Network Architectures 
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Mean Square Error 



(e) Architecture 20-17-6 



Figure 5.9 (Contd.) Convergence Pattern of Various Back-propagation 
Network Architectures 
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Figure 5.9 (Contd.) Convergence Pattern of Various Back-propagation 
Network Architectures 
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Table 5.5 Results of Backpropagation Architecture, 20-20-8-6, 
Transfer Function: LOGSIG, Trainig Algorithm: TRAINRP 


Fault 

Target 

Sample Number 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
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1 
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1.0000 
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Table 5.6 Performance 



Architectures 

Faults 



20-12-6 

20-17-6 




No Fault 

100% 

100% 

100% 

100% 

100% 

100% 


Mass 

Unbalance 

100% 

100% 

100% 

100% 

100% 

100% 


Bearing Inner 
Race 

100% 

100% 

100% 

90% 

100% 

90% 

100% 

Ball Spin in 
Bearing 

100% 

100% 

100% 

100% 

100% 

100% 

100% 

Bearing 

Outer Race 

70% 

100% 

70% 

100% 

100% 

100% 

100% 

Cocked Rotor 

100% 

100% 

100% 

100% 

100% 

100% 

100% 

Total 

95% 

100% 


98.33% 

100% 

98.33% 

iiiimgi 



Figure 5.11 Correct Fault Classification (in %) by Various Architectures 
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LOGSIG (Fig. 3.11(d)) was used as the Transfer Function in the network architectures 
and TRAINRP was the back-propagation training scheme. Investigations were carried out 
with other transfer functions (like TANSIG, PURELIN etc.) and training schemes 
(TRAINGD, TRAINGDA, TRAINGDX, TRAINLM) also. Though all the results are not 
presented here, a comparison between the convergence pattern for 20-20-8-6 architecture 
with LOGSIG and TANSIG transfer functions is given in Figs. 5.4(g) and 5.4(h). 

5.4 Probabilistic Neural Network Training 

A limited study was carried out with Probabilistic Neural Networks also. In a manner 
similar to that in the case of back-propagation algorithm, six hundred input vectors were 
used to train the network, while sixty vectors were set aside for testing the network. The 
Probabilistic Network being a go-nogo kind of gate, gave correct results in all sixty cases. 
This fact is shown in Fig. 5.12. A major limitation however, of the probabilistic network 
remains that its dimensions keep increasing with each incoming training vector 



Figure 5.12 Results of Fault Classification by Probabilistic Neural Network 
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CHAPTER 6 


CONCLUSIONS AND SCOPE FOR FUTURE WORK 


Xhe objective of this was to carry out multi-point instrumentation on a Fault Simulator 
Rig and integrate the hardware and software of Virtual Instrumentation with techniques 
of Artificial Neural Networks for typical fault identification problems in rotordynamics. 

It was observed that Virtual Instrumentation technology provides efficient multi-charmel 
display and signal processing capabilities. The hardware was installed in a mutually 
compatible manner and vibration signals could be acquired from seven different locations 
on the rotor rig. 

Algorithms were developed for on-line Fast Fourier Transformation of the incoming 
vibration data and automated peak detection from these Fourier plots. These algorithms 
provided the input data for the Artificial Neural Networks which were based on back- 
propagation and probabilistic concepts. Both procedures gave excellent results which 
established the propriety of usage of ANNs for condition monitoring activities in 

rotordynamics. 


However no attempt has been made in this work on quantification of the fault, once it is 
identified (e.g. estimate of the amount of unbalance, if the fault is identified as 
imbalance). The focus, presently, was to generate data for healthy and faulty rotor 
systems and develop a preliminary Neural Network diagnosis frame. It has been found 
that the testing success in addition to the input and hidden layer architecture. 

Further work needs to be carried out to make the ANN shell more realistic by 
incoiporating more nxunber of faults as well as more than one fault in combination with 
one another. Also investigations have to be conducted to quantify the fault, once the 
qualitative identification has been made. The present study also does not address the 
issue, where the faults are in a transitory stage of development. 
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